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ABSTRACT 

A new technique is presented for producing images from interferometric data. 
The method, "smear fitting" , makes the constraints necessary for interferometric 
imaging double as a model, with uncertainties, of the sky brightness distribution. 
It docs this by modelling the sky with a set of functions and then convolving 
each component with its own elliptical gaussian to account for the uncertainty 
in its shape and location that arises from noise. This yields much sharper res- 
olution than CLEAN for significantly detected features, without sacrificing any 
sensitivity. Using appropriate functional forms for the components provides both 
a scientifically interesting model and imaging constraints that tend to be better 
than those used by traditional deconvolution methods. This allows it to avoid 
the most serious problems that limit the imaging quality of those methods. Com- 
parisons of smear fitting to CLEAN and maximum entropy are given, using both 
real and simulated observations. It is also shown that the famous Raylcigh cri- 
terion (resolution = wavelength / baseline) is inappropriate for interferometers 
as it does not consider the reliability of the measurements. 
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1 INTRODUCTION 

Interferometers give us a much sharper view than filled 
aperture telescopes of the same area, by sampling the 
sky's spatial frequencies with a set of baselines given by 
the separations between each receiver. Unfortunately the 
distribution of samples ("visibilities") is incomplete, so 
the Fourier transform of the measured visibilities is not 
the sky brightness distribution. The Fourier transform 
of the visibilities instead yields the "dirty map" , which is 
the sky brightness distribution plus noise, convolved with 
the dirty beam (the Fourier transform of the sampling 
pattern). In other terms, a simple Fourier transform pro- 
duces an image with absolutely no power at unsampled 
spatial frequencies. As a result the dirty beam exhibits 
sidelobes causing the fainter structure in the image to 
be buried under the diffraction patterns of the brightest 
objects in the field. 

Unfortunately the most straightforward way of de- 
convolving away the effect of the dirty beam would di- 
vide by zero in the uv plane (the Fourier transform of 
the image plane) wherever no measurement was made, 
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and practical methods must instead attempt to fill un- 
measured regions of the uv plane with a more reason- 
able estimate than zero. Simply interpolating between 
the visibilities does not work in general because alias- 
ing of oscillations in the uv plane can erroneously move 
the emission toward the center of the image plane, and 
distort its appearance. Thus there is no unique prescrip- 
tion for extracting the optimum estimate of the true sky 
brightness, and many methods, most notably CLEAN 
(Hogbom 1974) and maximum entropy deconvolution 
(Gull & Skilling 1983, Cornwell et al. 1999), that vary in 
their properties have been devised. They are usually, if 
not strictly correctly, called deconvolution methods. They 
rely on expected or desired properties of the images such 
as positivity, locality, or smoothness to constrain the im- 
ages they produce. This paper presents a new method, 
called smear fitting, which uses as simple a model as it can 
as its main constraint, and can optionally use additional 
constraints such as positivity and/or confinement of the 
source to a certain region. It renders the measurements 
into images with better fidelity and resolution, and fewer 
image artifacts, than traditional deconvolution methods. 

A great benefit of smear fitting is considerably im- 
proved resolution for objects with peak brightness greater 
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than 4.20 times the root mean square (rms) noise (it does 
not change the resolution of fainter features) without a 
loss of sensitivity from reweighting the data. This feature 
is extremely important since better resolution cannot be 
achieved by simply adding data, and the noise that comes 
with it, from longer baselines or decreasing the weight of 
short baselines, without increasing the root mean square 
(rms) errors in surface brightness over the whole image. 
The objects of interest in typical radio astronomical ob- 
servations span wide ranges of both brightness and size, 
and smear fitting offers a way to optimally handle both. 
Smear fitting also produces a set of components mod- 
elling the source, and calculates the uncertainties of the 
distribution of those components on the sky. The com- 
ponents often correspond to distinct physical features of 
the source(s), making their parameters and associated 
uncertainties of immediate scientific interest. 

Smear fitting has been implemented as a modifi- 
cation (patch) to difmap (Shepherd 1997). The patch, 
known as smerf, is freely available at http://www.drao- 
ofr.hia-iha.nrc-cnrc.gc.ca/~rreid/smerf/. 



2 SMEAR FITTING 
2.1 Procedure 

Smear fitting is a two step process. In the first step a 
set of components, usually elliptical gaussians, with total 
visibility function V mM j e ;(u, v;p) is fitted to the visibilities 
Vi by varying the parameters p to minimize \ 2 '- 

^2 _ Vi — V m,odel{ui,Vi\p) 2 ^ 

where <Ti is the uncertainty of measurement Vi. In the 
second step each component is broadened, or "smeared" , 
until x ls raised by the number of degrees of freedom 
of the component's distribution on the sky. The visibility 
model and residuals are then Fourier transformed and 
summed together to form an image. An example of the 
effect of smearing is shown in Figure 1. 

The goal of this process is a final map that shows 
what viewers intuitively expect when looking at an im- 
age with resolution known to be imperfect: the true im- 
age convolved by the probability distribution of where 
the radiation originates from. The smeared map is ide- 
ally equivalent to the average of an ensemble of maps 
produced from all possible realizations of the noise added 
to the measured visibilities. The first implementation of 
smear fitting used the Monte Carlo method, but the cur- 
rent technique of broadening the components until \ 2 is 
raised by a certain amount is far more efficient. 

In the first step, producing an unsmeared model, it is 
usually impossible to specify a model in its entirety and 
then simultaneously fit all of its parameters. This is be- 
cause typical initial dirty maps are dominated by a small 
number of bright objects that must be modeled and re- 
moved before the underlying structure becomes apparent, 
exactly as with CLEAN. Fortunately components that 
do not appreciably overlap can be independently fitted, 
as can neighboring features with proper downweighting 
of the short baselines (Reid 2003). Therefore the model 



is built incrementally (either manually or by running a 
script) with cycles of: 

(i) Adding to the model one or more elliptical gaus- 
sian(s) for the brightest peak(s), 

(ii) specifying the set of parameters that should be 
fitted (which includes parameters from previously fit- 
ted components that will be affected by the new com- 
ponent (s)), then 

(iii) fitting them to the visibilities by minimizing \ 2 ■ 

A gaussian component fit to an unresolved source 
can approach a 8 function, naively extrapolating power to 
spatial frequencies far beyond those sampled by the mea- 
surements. This makes it necessary to somehow smooth 
the model so that it does not claim a higher resolution 
than the measurements warrant. Smear fitting accom- 
plishes that with its second step, "smearing". During 
smearing, each component of the model is broadened by 
minimizing the sharpness function while constraining \ 2 
to rise to xl nsmearei + A using a Lagrange multiplier. The 
sharpness function is 

B - ~2 y f 2 ( 2 ) 

'"max / > f J c 

where f c and A c respectively are the flux and effective 
area of a component c. The effective area of a gaussian 
is 2nra 2 = 7rr-a 2 /(41n2), where r is its (minor/major) 
axial ratio and a and a are its standard deviation and 
full width at half maximum (FWHM) along its major 
axis. Only relative changes in B matter, so it is convie- 
nient to scale it using Umax, the maximum baseline length 
in wavelengths. That makes the contribution to B > 1 
for components sharper than the spatial frequency cor- 
responding to the longest baseline. Squaring the fluxes 
allows B to be used with negative components as well as 
positive ones. 

The form of B came from devising a function to mea- 
sure the squared amplitude of the model visibilities inte- 
grated over the entire uv plane relative to the squared 
total flux, with a modification to ignore component over- 
laps. The larger B is, the more of the model is in the 
part of the uv plane unsupported by measurements, so it 
is a quantity that should be minimized by reliable mod- 
els. Each component's flux is held fixed during smearing 
to prevent the sharpness decreasing by transferring flux 
between unrelated components. 

A is a positive number limiting the amount of smear- 
ing that is allowed. The larger it is, the more smearing 
there will be, corresponding to a larger confidence region 
of the model parameters. Although model parameters are 
often reported with confidence intervals of ±2, 3, or more 
standard deviations, in the presentation of an image it is 
most natural to plot the image "smeared by one stan- 
dard deviation" in order to recreate the probability dis- 
tribution of the emission on the sky. Another reason to 
only use one standard deviation is that it is customary 
in radio interferometry to publish images with whatever 
is not accounted for by the deconvolution (the residuals) 
added to the result of the deconvolution. Since smearing 
reduces the amplitude of the long baseline model visi- 
bilities (broadening the image plane distribution makes 
the uv plane distribution more compact), it effectively 
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returns information from the model to the residuals. The 
residual map is of course dirty, so the final map will ap- 
pear "dirtier" wherever the effective smearing beam is 
comparable to or larger than the dirty beam. This is most 
likely not the desired effect, so it is important to distin- 
guish performing a reliable deconvolution of a set of data 
from displaying only the information in that data that 
meets a certain level of reliability. The former is what 
smear fitting tries to do, but it can be adapted to the 
latter task by displaying only the model in the final im- 
age, setting A appropriately, and making it clear that 
the resulting image is almost certainly missing some real 
structure that did not "make the cut" . 

Finding A for a given confidence level is accom- 
plished by considering the Monte Carlo view of the pro- 
cess and requiring that the fraction of models with \ 2 no 
greater than Xori 9 + A, after being perturbed by noise and 
refit, matches the confidence level. Assuming gaussian er- 
rors in the data, the model parameters will lie within ±1 
standard deviation of the best fit 68.3% of the time. Typi- 
cally each component is an elliptical gaussian smeared on 
its own, so the relevant degrees of freedom in its distri- 
bution on the sky are its location (2 degrees), major axis, 
axial ratio, and position angle, for a total of 5 degrees of 
freedom. Solving 



0.683 = 



(3) 



with P$(x ) being the probability distribution for \ j 
yields A = 5.89. A for a 2 a confidence interval with 
5 degrees of freedom comes from solving 



0.954 = 



(4) 



and is f f .3. 

To maintain the relation between convolving each 
feature with its uncertainty and minimizing the sharp- 
ness while raising \ 2 by A, it is usually necessary to 
smear each component separately. Otherwise a bright 
sharp component would steal the portion of A that would 
normally go to a fainter, broader, component, even if they 
were unrelated. The only justification for simultaneously 
smearing more than one component is when the compo- 
nents are inseparable parts of a single feature. In such 
(rare) cases some customization is needed, either a mod- 
ification of the sharpness function to fairly distribute the 
smearing, or preferably replacement of the generic ellipti- 
cal gaussian components with a single component of a dif- 
ferent type. Currently implemented alternatives to ellip- 
tical gaussian components are uniformly bright (i.e. opti- 
cally thick) elliptical disks, optically thin ellipsoids (Fig- 
ure 2), and Sunyaev-Zel'dovich clusters (Pearson 1999, p. 
351). 



2.2 The expected amount of smearing 

A rule of thumb for the amount of smearing an isolated 
component will receive can be generated using the fol- 
lowing approximations: 

(i) The uv sampling density is taken to be an ellip- 




Figure 1. Before and after smearing compared to CLEAN 
for a naturally weighted VLA snapshot observation of J0354- 
052 (Reid ct al. 1999). Top left: unsmeared image. Top right: 
smeared image. Note that smearing fattens the knife-edges, 
and that less significant components (shown as ellipses with 
FWHM major and minor axes) are smeared more. Bottom left: 
unsmeared image convolved with the CLEAN beam. Bottom 
right: CLEAN image. The contours start at 0.125 mjy/arcsec 2 
and are each separated by a factor of 2. The solid gray ellipse 
in the lower left corner of each image is the FWHM CLEAN 
beam. 



tical gaussian, specifically the Fourier transform of the 
CLEAN beam. 

(ii) Smearing is restricted to convolving the unsmeared 
component with a scaled version of the CLEAN beam 
with major axis k times as large as the CLEAN beam's 
major axis, ab- 

(iii) The component's unsmeared shape is assumed 
to have the same axial ratio and position angle as the 
CLEAN beam, so that it can be specified by the single 
parameter £ = a c /cib, where a c is the component's major 
axis. 

Then the expected rise in \ 2 due to smearing is 



(Ax 2 



2f 2 W 



1 + 2C 2 1 + 3£ + 2£ 2 



(5) 
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Figure 2. A young planetary nebula, Vy2-2, as it appeared 
at 15 GHz in February 1982. Top left: uniformly weighted 
CLEAN. Top right: The result of automatic smear fitting with 
the modcons script. Bottom left: the result of fitting and then 
simultaneously smearing a positive and a negative gaussian. 
Note the negative intensity in the center of the shell. Bottom 
right: the result of fitting and then simultaneously smearing a 
positive and a negative optically thin ellipsoid (OTE). Vy2-2 
is expected to be an optically thin shell, and it appears to be 
well modeled by the nested OTEs. The grayscale is a linear 
ramp from —1.725 (white) to 3.510 (black) Jy/arcsec 2 , and 
the flat gray ellipses are the FWHM beam extents. 



with £ 



1 + 2C 2 



(6) 



and W being the sum of the visibility weights. 

According to Equation 5, (Ax 2 ) asymptotically ap- 
proaches a maximum of fl W/(1+2Q 2 ) 2 for large smearing 
beams. This means that sufficiently diffuse components 
do not possess enough signal to reach the target (A^ 2 ) 
of A, due to a combination of insufficient flux and the 
fact that as components are broadened they affect fewer 
visibilities. Using the above approximations, the limit at 
which smearing to a level of A cannot be done is 



f c W~ (1 + 2C 2 )A 



(7) 



For a component that is unresolved in the unsmeared 
map, this corresponds to a surface brightness of vA times 
the theoretical r.m.s. noise of the image. If the component 
is significantly resolved, the surface brightness threshold 
approaches V2A times the rms level. In other words, 
there is, unsurprisingly, a minimum surface brightness 
required for the location and shape of a component to be 
determinable with any significance. This holds true even 
when the above approximations are relaxed. 

Thus a low surface brightness component can be 
completely smeared away without sufficiently raising \ 2 . 
In effect, it is completely returned to the residual map, 
and should be removed to simplify the model. If it con- 
tains a significant amount of flux it should be CLEANed, 



since the dirty beam has no total flux unless single dish 
measurements have been included. 



2.3 Instabilities in model construction and 
techniques to overcome them 

Most radio observations have objects in their field that 
are too compact to be resolved with the given data, and 
as x 2 is minimized, there is little to stop the major and 
minor axes of the model components corresponding to 
those objects from collapsing to zero. Presumably all true 
sources have a nonzero size, but the noise, since most of it 
originates in the uv plane instead of on the sky, does not 
necessarily obey the same rules as physical sources. The 
noise amplitude does not decrease with increasing base- 
line length (worse, it gains weight since the visibilities 
become more isolated), so it is quite easy for noise to col- 
lapse a compact component's shape (see the appendix). 
When that happens the best thing to do is to fix the mi- 
nor (and major, if necessary) axes at tiny but nonzero 
values and to move on to other components. This has no 
effect on the final image since the width of the smeared 
component will be dominated by its smearing "beam" . 

The "knife-edge" case of a component with an ex- 
tended major axis but collapsed minor axis is particularly 
common since it does not require a conspiracy of noise 
on both axes, and there are additional ways to produce 
it. The original version of difmap featured an interesting 
shape parameterization that unfortunately allowed the 
fit to enter a domain where the minor axis was imagi- 
nary. Once that happened the minor had to be clamped 
at zero and it became very difficult for the fit to return to 
physical plausibility. The smerf patch to difmap features 
a better shape parameterization (Equations 8 and 9) that 
allows the model fitting routine to try any real value for 
the internal parameters a and 7 while keeping the full 
width half maximum of the major axis, a ma j, and axial 
ratio r (ratio of the minor axis to the major axis) , within 
their physically allowable domains: 



(8) 
(9) 



Umax is the longest baseline length. 

Also, knife-edges can appear where one model com- 
ponent has been used to fit two unresolved features on 
the sky, so that \ 2 minimization joined the features 
with a line, smerf can automatically detect and remove 
such knife-edge components, replace them with pairs of 
smaller components (if they are sufficiently far apart), 
and reminimize % ■ 

All of the above issues occur in the venerable model 
fitting stage of smear fitting, and have fostered an im- 
pression in the community that fitting large sets of el- 
liptical gaussians to interferometer data is not feasible, 
smerf includes several features to improve the robustness 
of model fitting, but truly unresolved sources are ubiqui- 
tous, and smerf's main method of dealing with them is 
smearing. Smearing broadens each component as much 
as possible given the constraints of the data, smooth- 
ing 8 functions and knife-edges as in Figure 1, and at 
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the same time adds to the image a probability distribu- 
tion for the locus of each component. The smeared map 
provides a very intuitive way to judge whether different 
components correspond to significantly distinct features 
on the sky. 



3 COMPARISON TO ESTABLISHED 
METHODS 

3.1 CLEAN 

The CLEAN process (Hogbom 1974 and Cornwell et al. 
1999) is similar to smear fitting except that it uses S func- 
tions for the components and a fixed gaussian for "smear- 
ing" , which is called restoration in the CLEAN nomencla- 
ture. Most of the difference between the two methods in 
the final image arises from the choice of smearing beam, 
but the philosophical difference between the methods is 
in the way they interpolate between visibilities in the uv 
plane. Smear fitting tries to make a gaussian go through 
the error bars of each visibility, but CLEAN fits the flux 
in the central visibilities and tapers off its gaussian ac- 
cording to the sampling density (Figure 4), even though 
the sky distribution is unaffected by the sampling den- 
sity. In other words, CLEAN fails to fit the measurements 
and tends to underestimate the achievable resolution. 

Smearing convolves faint components with a (po- 
tentially much) larger beam than CLEAN. Using Equa- 
tion 5 and its accompanying approximations, the thresh- 
old at which the smearing beam becomes larger than the 
CLEAN beam (Figure 3) is 

~ [(1 + 2C 2 )(3 + 5C 2 + 2C 4 )A] 1/2 /(l + C 2 ) (10) 

J rms 

For a component that is unresolved in the unsmeared 
map, this corresponds to a surface brightness of \/3A 
times the theoretical root mean square noise of the dirty 
image. For unresolved elliptical gaussians smeared by one 
standard deviation, smearing is effectively equivalent to 
CLEAN at approximately \/3A (~ 4.20) times the rms 
surface brightness. If the component is significantly re- 
solved ((" — > oo), the threshold surface brightness ratio 
quickly approaches 2£\/A (~ 4.85^). The smearing beam 
continues to enlarge below that threshhold, becoming in- 
finitely large as the surface brightness of the unsmeared 
feature approaches zero. 

Technically, this means that CLEAN is placing too 
much faith in the sharpness and/or positional accuracy 
of the measurements for components below the surface 
brightness limit in Equation 10, but in imaging terms 
smear fitting's behavior is very similar in the normal case, 
where the residuals are added to the model for the final 
image. When a component is severely smeared, its model 
only accounts for the innermost visibilities, and leaves the 
outer baselines in their original form. In other words, the 
more a component is smeared, the more the dirty beam 
shows through in the final image, and the appearance of 
a radically smeared component in the final image is not 
really broadened any more than it would be by CLEAN. 
The size of the smearing beam for each component can 
still be seen however, in a textual listing of the smeared 
model. 



1 1 r~i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 r 
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Figure 3. The surface brightness limit (relative to the theo- 
retical root mean square noise) in the dirty map at which the 
smearing beam (one standard deviation) would be same size 
as the CLEAN beam for an elliptical gaussian, as a function of 
£, the ratio of the unsmeared component major axis a c relative 
to the CLEAN beam major axis aj,. 



3.1.1 Total Flux Density and Source Counts 

Although there is little cosmetic benefit to adding fea- 
tures below the limit of Equation 10 to the model, it 
can be useful for both CLEAN and smear fitting when 
the total flux density in the field is of interest, but no 
zero baseline (i.e. single antenna) measurement is readily 
available. In that common case, the theoretical total flux 
density in the dirty beam, and thus the residual map, is 
zero. Practically, the positive part of the dirty beam is 
concentrated in the central lobe and the negative part is 
spread over the rest of the Fourier transformed region, so 
a sum of the residuals over a small area will have nonzero 
flux density. A more accurate estimate of the total flux 
density can be obtained by modelling the trend of the 
short baseline visibilities using a deconvolution method 
and including diffuse structure that would not otherwise 
require deconvolution. Both CLEAN and smear fitting 
can include arbitrarily faint components in their models 
although systematic effects can make those components 
unreliable. 

As with CLEAN, a source count will of course be in- 
complete if model construction is stopped early and only 
the component list (instead of the image) is used for the 
count. Similarly to flux estimation, either the model con- 
struction stage of smear fitting can be continued to the re- 
quired depth, or a different deconvolution method may be 
used for the fainter emission. More interestingly, a single 
smear fitted component can correspond to many CLEAN 
components, so a list of smear fitted components should 
provide a more accurate raw count than a list of CLEAN 
components to the same depth. More accuracy, however, 
does not imply absolute accuracy. Typically, but not al- 
ways, smear fitting allocates one component for each re- 



© 2006 RAS, MNRAS 000, 1-16 



6 R. I. Reid 



solved feature, and the counter still needs to decide which 
components should be coalesced into "sources". 

3.1.2 Visibility weighting 

Since CLEAN depends so heavily on the sampling pat- 
tern, it is customary to weight the outer visibilities more 
heavily than the inner ones, sacrificing some sensitivity 
(especially to diffuse objects) for an improvement in res- 
olution (Briggs 1995). The basic idea of a common proce- 
dure, called uniform weighting, is to weight each visibility 
by the reciprocal of the number of visibilities within a cer- 
tain distance of it. Since long baselines tend to be more 
isolated than short ones, uniform weighting - or anything 
more extreme, called superuniform weighting - produces 
more compact central lobes in the dirty beam, but the 
rms noise in the surface brightness is degraded by the 
partial removal of short baseline data and the decrease 
in beam area (the rms noise in the surface brightness is 
inversely proportional to the product of the beam area 
and the square root of the number of measurements). 

Smear fitting does not require any reweighting in its 
final images, and dynamically allocates the outer visibili- 
ties as much control as they deserve based on their signal 
to noise ratios, so it achieves optimum sensitivity without 
sacrificing any resolving power. 

3.1.3 Fixed vs. variable resolution 

Smear fitting can, unlike CLEAN, adapt to scales smaller 
than the dirty beam without losing sensitivity to diffuse 
structure (Figures 6, 10, and 11). In fact the most strik- 
ing difference between smear fitted images and CLEAN 
images is that smear fitted images usually include one or 
more peaks that are much sharper than the dirty beam. It 
has been a long held precept that the smallest resolvable 
angle, min , is given by 8 min = 1.2A/D, where D is the 
effective diameter of the telescope. That equation comes 
from the Rayleigh criterion for a circular aperture (Born 
& Wolf 1999), which states that two equally bright point 
sources are indistinguishable from a single source if their 
separation is less than the distance between the diffrac- 
tion pattern's central peak and its first minimum, since 
at that distance most diffraction pattern convolved fea- 
tures appear to meld together. Deconvolution methods 
that reveal details finer than the diffraction pattern are 
thus often called "superresolving" , but since the purpose 
of deconvolution is to remove the effect of the diffraction 
pattern, the Rayleigh criterion is not the right one to use 
as a standard for resolution. 

In general the achievable resolving power of deconvo- 
lution is limited by how well the effect of the diffraction 
pattern can be removed, which is determined by both the 
size of the diffraction pattern's main lobe and the signal 
to noise ratios of the features to be resolved. The blended 
beams of very close features are difficult to disentangle 
unless the data have a high signal to noise ratio, and con- 
versely distant features are not reliably separated if one 
or both are so faint that the probability distributions of 
their locations significantly overlap. 

As illustrated in Figure 4, the resolution of the 
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Figure 4. The response in the uv plane of smear fitting, 
CLEAN, and maximum entropy to a simulated snapshot ob- 
servation (error bars) of a 10 mjy point source (horizontal 
gray line) by a 7 antenna VLA subarray. Maximum entropy 
did not perform well with this extremely sparse set of data, 
and unlike the other deconvolutions did not have the beneficial 
constraint of fitting a very simple model to this very simple 
source. Also, its prior (a flat image) was completely inappro- 
priate for a point source, so this should not be considered to 
be a typical case for maximum entropy deconvolution. 

CLEAN beam (and the Rayleigh criterion) is set by the 
average spatial frequency of the measurements regardless 
of their values. That is exactly what is needed to emulate 
a nondeconvolved image from a filled aperture telescope 
of the same size, since with filled aperture instruments 
there is one detector per direction (for example, a pixel 
in a CCD) that simply receives the sum of all the spa- 
tial frequencies sampled by the dish. Interferometers have 
separate measurements of the visibility function at the 
sampled spatial frequencies, which the model fitting step 
of smear fitting uses to discern the trend of the data. The 
smearing step biases the model within the framework of 
its parameters to be as broad as possible without devi- 
ating too far from the data. Figure 5 demonstrates that 
CLEAN is able to detect sub-beamwidth structure which 
is obliterated by the restoring beam in the final image. 
Smear fitting is able to display the sub-beamwidth struc- 
ture, but without the often erroneous discreteness of the 
CLEAN component distribution. 

3.1.4 Multiple scale CLEAN 

"Multiresolution Cleaning" (Wakker & Schwarz 1991) 
is an extension to CLEAN that aims to improve upon 
CLEAN's handling of extended emission (e.g. Figure 6) 
by using not just one CLEAN beam, but a set of CLEAN 
beams of different scales. In practice two variations, 
multi-scale CLEAN (Cornwcll & Holdaway 2006) and 
adaptive scale pixel (ASP) decomposition (Bhatnagar & 
Cornwell 2004) work better and differ in how they use and 
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Figure 5. Top left: naturally weighted CLEAN image of a 
B configuration VLA snapshot (Reid et al. 1999). The lowest 
contour is at 31.25 /xjy/arcsec 2 . Top right: uniformly weighted 
CLEAN image of the same snapshot. The CLEAN components 
are shown as crosses, and hint that there are two jets and a 
core hidden by the CLEAN beams of the lobes. Bottom left: 
smear fitted image of the same data. The contours start at 
31.25 /^Jy/arcsec 2 . Bottom right: CLEAN image made with 
the addition of VLA A configuration data, confirming the re- 
ality of the structure suggested by the B configuration CLEAN 
components and made visible by the smear fitted map. Note 
that some of the apparent discrepancy between the bottom 
images comes from the B array emphasizing emission from 
larger spatial scales than the A array. The lowest contour is at 
1 mjy/arcsec 2 . In all of the images each contour is separated 
by a factor of 2 from its neighbor(s), and the gray disks show 
the FWHM of the dirty beams. 
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Figure 6. Deconvolved images of a simulated VLA snap- 
shot of a 0.1 Jy, 0.1" FWHM, spike (central dot) on a 0.4 
Jy, 5" FWHM, circular gaussian. Top: CLEAN. The residu- 
als are excessively large and reflect the structure of the dirty 
beam convolved with the broad feature. Bottom: smear fitting. 
The residuals are essentially uniformly distributed noise. The 
model components for both images were automatically placed 
by a script without knowledge of the true source's structure. 
The solid gray ellipse is the FWHM extent of the CLEAN 
beam. The contours start at 0.25 mjy/arcsec 2 and are each 
separated by a factor of 2. Both methods used natural weight- 
ing. 

set the variously scaled beams. For structures larger than 
the standard CLEAN beam their behavior should fall 
somewhere between smear fitting and standard CLEAN 
if natural weighting is used, with somewhat reduced flex- 
ibility because of the finite number of choices. Practi- 
tioners can however in principle preload the algorithms 
with elongated gaussians if needed, and ASP dynami- 
cally updates its set of scales. Structures smaller than the 
standard CLEAN beam can be resolved if sharp enough 
beams are used but such resolution (as in the bottom of 
Figure 7) is not reliable since CLEAN and its derivatives 
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do not have a signal to noise based mechanism for testing 
resolution. Much more effort has been put into their abil- 
ity to recover structures the size of the standard CLEAN 
beam and larger. 

3.1.5 Self-calibration 

Smearing assumes that the measurement errors are nor- 
mally distributed noise, but the quality of many data sets 
is limited by calibration errors, which must be corrected 
by selfcalibration Cornwell & Wilkinson (1981). Selfcal- 
ibration assumes (but is constrained in its action when 
the number of receivers is greater than four) that the 
model includes most of the source and that the residu- 
als are mostly due to calibration errors. Amplitude self- 
calibration in particular can be dangerous with classi- 
cal CLEAN since traditional CLEAN picks the brightest 
peaks first, and leaves behind any real structure that is 
fainter than the calibration error induced artifacts around 
bright peaks. This results in a tendency of amplitude 
selfcalibration to distort the gains of central antennas 
if applied carelessly. Smear fitting and multiresolution 
CLEAN are both able to fit broad features at an early 
stage, making it possible in most cases to apply selfcali- 
bration before calibration errors threaten to contaminate 
their models and prevent convergence on the correct so- 
lution. 



3.1.6 Comparisons using known brightness 
distributions 

Figures 8 to 13 present the responses of CLEAN and 
smear fitting to a fairly complex source under a vari- 
ety of conditions. In order that the true image could be 
known, and to dispense with the effects of calibration 
errors, Miriad's (Sault ct al. 1995) uvgen command was 
used to simulate VLA observations (with noise) of a given 
model. To make the input model representative of a real 
observation, it includes the northwest jet and core of a 
smear fitted model made with the combination of several 
VLA A and B array snapshots of J012435-040105 (Reid 
ct al. 1999). A smear fitted model was used because it 
was the best source of both very sharp and broad real- 
istic structure available. The morphology of this source 
is also very challenging since it contains a sharp feature 
of moderate flux density embedded in a broader stream, 
and the jet bends over on itself into a crook. The over- 
all envelope of the jet was enhanced by the addition of a 
broad elliptical gaussian. To prevent the model from be- 
ing completely composed of smear fitting's stock in trade, 
several dozen CLEAN components (restored with the A 
array beam), both positive and negative, were randomly 
scattered around the jet. Note that the CLEAN compo- 
nent positions are completely random and not centered 
on pixels. 

In the simulated B array observations (Figures 9 to 
12) the challenge was to resolve the three sharp peaks 
near the core and as much of the jet morphology as pos- 
sible. Smear fitting did much better, even when uniform 
weighting was used for CLEAN. Neither of the algorithms 
managed to resolve the two peaks closest to the core. 




I 



i 




Figure 7. Deconvolutions of a section of the Canadian Galac- 
tic Plane Survey (Taylor et al. 2003) including many AGN and 
the less sharp galactic HII region Sharpless 155. Top: smear 
fitted. Middle: multiscale CLEANed and restored with a gaus- 
sian fit to the uniformly weighted dirty beam. Bottom: mul- 
tiscale CLEANed without convolution by a CLEAN beam. 
Convolution by the standard CLEAN beam obscures too much 
detail, but doing without it leaves the compact sources as <5 
functions embedded in the next larger gaussian used by mul- 
tiscale CLEAN. A square root function has been used for the 
grayscale intensity mapping. 
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Figure 8. The reference image "observed" by the simulations, 
based on a smear fitted model of a 4.7 GHz VLA A and B ar- 
ray observation of the northern two— thirds of J012435-040105, 
with some faint (restored) CLEAN components, both positive 
and negative, included. The contours start at 20 /ijy/arcsec 2 
and each is separated by a factor of 2 from its neighbor (s). 




Figure 9. Image, using the true model, of a simulated 4.7 GHz 
VLA B array observation of Figure 8 placed at a declination 
of +30°. The contours start at 50 ^tjy/arcsec 2 and each is 
separated by a factor of 2 from its neighbor(s). The solid gray 
ellipse in the lower left corner shows the FWHM extent of the 
dirty beam. 

The smear fitted image of the extended observation (Fig- 
ure 12) did very well otherwise, while the CLEAN image 
was only able to improve its sensitivity, not its resolution. 
In theory the resolution of a smear fitted image can im- 
prove without bound as data of the same average baseline 
length is added, but in practice the improvement would 
eventually be limited by calibration errors. 
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Figure 10. A smear fitted image made using the same data, 
contours, and beam as Figure 9. 



With the simulated VLA A array observation, Fig- 
ure 13, of Figure 8, smear fitting leaves the diffuse emis- 
sion mainly to the imagination of the viewer, although 
it could be argued that there is a statistically significant 
enhancement of the density of small peaks in the region 
of the diffuse emission. Away from the emission region 
the residuals are satisfyingly noise-like, but ironically the 
broad negative artifacts to the north and south of the jet 
in the CLEAN image announce to the connoisseur the 
possible existence of badly imaged diffuse emission. 



3.2 Maximum Entropy 

Smear fitting and CLEAN look for structure in the im- 
age plane and then impose smoothness based on uv 
plane considerations, but maximum entropy deconvolu- 
tion (Cornwell et al. 1999, Cornwell & Evans 1985), or 
ME, takes a different approach by starting with a (usu- 
ally) smooth default image (the prior) and trying to 
maintain smoothness in the image plane as structure is 
imposed by the visibilities. Otherwise it has the same goal 
as smear fitting: to produce the smoothest image possible 
within the constraints of the data. In ME deconvolution 
each pixel is considered to be a variable, but smear fitting 
can be considered as a type of ME where the variables are 
a set of model parameters. This is not to say that smear 
fitting with an elliptical gaussian for each pixel would be 
equivalent to traditional ME. First, each gaussian has 6 
degrees of freedom, while a pixel only has one. More im- 
portantly, in traditional ME it is the distribution of pixel 
intensities that is compared to the measurements, while 
smear fitting calculates \ 2 f° r the se t of model parame- 
ters. 

The choice of definition for the entropy function, TL, 
to be maximized, is controversial (Narayan & Nityananda 
1984) but the most popular one is 
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Figure 11. CLEAN dcconvolutions of the same simulated ob- 
servation used in Figure 9. Top: naturally weighted image with 
the same contours as Figure 9. Bottom: uniformly weighted 
image with contours starting at 0.2 mjy/arcsec 2 , each sep- 
arated from its neighbor(s) by a factor of 2. The solid gray 
ellipses in the lower left corners shows the FWHM extent of 
the dirty beam. 

k 

where Ik and Mk respectively are the intensities of the 
fcth image and prior pixels. Equation 11 can only be used 
for positive images, but other functions which are be- 
tween linear and quadratic, like 

H = -X>(cosh^-^) (12) 

where here a is the noise level in the image, also provide 
smoothing, and have been used for polarization images. 
All of these functions, however, only consider the sum of 
functions of individual pixels. This is hazardous since the 




Figure 12. Dcconvolutions made from a simulated 8 hour 
VLA B array observation of Figure 8 placed at a declination 
of +30° . Top: smear fitted image with contours starting at 10 
/^Jy/arcsec 2 . Bottom: uniformly weighted CLEAN image with 
contours starting at 50 /xjy/arcsec 2 . Each contour is separated 
by a factor of 2 from its ncighbor(s), and the solid gray ellipses 
in the lower left corner shows the FWHM extent of the dirty 
beams. 



data, being in the uv plane, are more directly connected 
with the relationships between neighboring pixels than 
the values of individual pixels. In particular, if the source 
has a sharp peak on top of a diffuse background, the 
background dilutes the smoothing power that 7i would 
get from its steepness at very low intensities, and the 
peak is left partly dirty (Figure 14). 

Another criticism of traditional ME is that the de- 
fault prior image, a flat distribution, is actually a very 
unlikely state for an interferometric image (unless noth- 
ing has been observed!). ME is thus biased toward putting 
flux in pixels that should not have any, as in Figure 4. 
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Figure 13. Deconvolutions made from a simulated VLA A ar- 
ray snapshot of Figure 8 placed at a declination of +30° . Top: 
smear fitted image with contours starting at 0.5 mjy/arcsec 2 . 
Bottom: naturally weighted CLEAN image with contours 
starting at 0.8 mjy/arcsec 2 . Each contour is separated by a 
factor of 2 from its neighbor(s), and the solid gray ellipses 
in the lower left corner shows the FWHM extent of the dirty 
beams. 



Smear fitting, like CLEAN, is more accepting of negative 
pixels, although both often use positivity as a constraint 
when constructing their models. 

Smear fitting and ME both produce final results that 
are biased away from the best fit to the measurements, 
but they have their justifications. ME has two, the first 
being the importance of not claiming anything that is 
not absolutely necessary, or in other words maximizing 
the entropy. The second justification is simply that ME 
produces better images with the bias than without it 
(simulatable by making the uncertainties approach zero). 
Smear fitting adds the goal of trying to estimate the prob- 
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Figure 14. Ringing in a maximum entropy image, made using 
the AIPS task VTESS, of the simulated data used in Figure 6. 
The contours start at 40 mjy/arcsec 2 and are each separated 
by a factor of 2. 
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Figure 15. Comparison of maximum entropy and CLEAN 
to smear fitting, for a low elevation observation of J125720- 
333450 (Reid et al. 1999). The contours start at ± 0.032 mjy 
per square arcsecond, and increase by a factor of two between 
successive contours. Each map used the same dataset, self- 
calibrated using smear fitting. The bottom left graph plots 
the intensity of each pixel in the smear fitted (vertical axis) 
image against that of the same pixel in the maximum entropy 
(horizontal axis) image. 



ability distribution of where the measured light may have 
originated from. Seen this way, it is not biased at all, if 
the uncertainty determination is correct. 

Figure 15 illustrates the similarity of maximum en- 
tropy to smear fitting, although it may not look like it 
at first glance since the largest and most distracting con- 
tours are the faintest and least reliable. It was not feasible 
to deconvolve down to the noise, or even the rms level of 
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the smear fitting residuals, with maximum entropy, with 
any of the large sample of VLA snapshots of jets in Rcid 
ct al. (1999). That is partly due to the hot spots that all 
jets have. Normally they would be CLEANed away be- 
fore applying maximum entropy, but as the CLEAN im- 
age shows, convolving the three brightest points with the 
CLEAN beam would have hidden much of the structure. 
Also, it could be that pixel-by-pixel flexibility of maxi- 
mum entropy leaves it more vulnerable to the sampling 
sparseness of the snapshot survey, which could effectively 
act as an additional apparent noise source. 

Nevertheless, when the lowest contours are ignored, 
the ME and smear fitted maps are quite similar, and 
both show an effective beam for bright emission that 
is much smaller and less elongated than CLEAN 's. The 
lower left corner of Figure 15 quantitatively compares 
them by graphing the intensity of each smear fitted pixel 
against the intensity of the corresponding pixel in the 
ME map. If the images were identical, the locus of points 
would be a straight line with slope 1. Obviously the range 
of maximum entropy pixel values is only about a quar- 
ter of that in the smear fitted image. This cannot be 
fully accounted for by the maximum entropy image be- 
ing "dirtier" , since that mainly affects the faintest pixels. 
This may be due to maximum entropy treating entropy 
as a global property of the image, while smear fitting 
treats entropy (smearing, roughly) as a local property 
of each component. Smear fitting thus prevented from 
wrongly transferring flux between separated features in 
order to lower the overall sharpness of the image. Indeed, 
the sharpest components tend to have the most signal 
and are smeared least. 

Smear fitting and ME match more closely at medium 
intensities, as shown by Figure 16. CLEAN (Figure 17), 
however, does not approach a one-for-one match with 
smear fitting except at very low intensities, where ME 
also has a short dense locus of points with overall slope 
1 (the inner 15 (j,Jy of Figure 16). Those pixels are in 
the northwest lobe, where because it is broad and flat 
the pixel values are not strongly affected by convolution 
with any beam the size of the CLEAN beam or smaller. 



3.3 Pixel Based Models 

All existing deconvolution methods produce a rectangu- 
lar array of pixel values as their final image, but there is 
no fundamental reason why their internal representation 
of the source should also be a rectangular array of pixel 
values. Although the desired output, an image capable 
of being displayed on common computer hardware, is a 
set of pixels, the input measurements are an irregular ar- 
ray of visibilities, and operations which need to consider 
measurement uncertainty tend to be more practical when 
performed in the same basis as the measurements. One 
of the motivations of smear fitting was to bring imaging 
conceptually closer to the measurements, and as a result 
the models used in smear fitting have no explicit depen- 
dence on pixel size or shape. 

More prosaically, the locations of the component cen- 
ters in smear fitting are not quantized to land on pixel 
centers. In many algorithms, including ME and non- 
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Figure 16. The fainter pixel intensities of the smear fitted 
(vertical axis) and ME (horizontal axis) images of Figure 15. 
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Figure 17. The pixel intensities of the smear fitted (vertical 
axis) and CLEAN (horizontal axis) images of Figure 15. Note 
that the dynamic range of the CLEAN axis is much smaller 
than that of the smear fitted axis. 



negative least squares (NNLS, Briggs (1995)) deconvolu- 
tion, the model is a set of pixels, and in commonly avail- 
able implementations of CLEAN the model components 
are only placed on pixel centers (although in principle 
they do not need to be). Pixels poorly represent features 
that are centered on pixel edges, which can degrade the 
usefulness of pixel based models for selfcalibration. Per- 
ley (1999) has also shown that a sharp feature centered 
on a pixel edge cannot be represented by pixel centered 
CLEAN components without violating the positivity con- 
straint. 

The Pixon deconvolution method (Puctter & Yahil 
1999) shares with smear fitting the concept that the 
model (a preliminary image in Pixon's case) should be 
smoothed as much as possible within the constraints of 



© 2006 RAS, MNRAS 000, 1-16 



Smear fitting 13 



the data, and that the amount of smoothing should be 
determined locally, not from a global prior as with ME. 
Instead of smear fitting's procedure of fitting a model to 
the measurements by minimizing x 2 an d then smoothing 
that model, the Pixon method convolves each pixel in a 
model image with the largest acceptable kernel in a list 
of typically a dozen gaussians. The Pixon technique thus 
uses a pair of variables (flux and kernel size) for each 
pixel, while smear fitting eschews pixels and attempts to 
model the source with the minimum number of parame- 
ters, typically several dozen, required by the visibilities. 

X 2 minimization adjusts the model to maximize the 
probability of the data given the model, so smear fitting 
could be broadly classified as a smoothed maximum a 
posteriori probability (MAP) algorithm, while the Pixon 
method results in a similarly smoothed NNLS image. 
As a NNLS variant, the Pixon method cannot represent 
sources with both positive and negative emission. Bhat- 
nagar & Cornwell (2004) also note that the Pixon method 
is not suited to interferometry, since the algorithm relies 
on a compact point source response function (i.e. a dirty 
beam which is zero outside a finite region) , and noise that 
is independent and additive in the image plane. 



3.4 Processing speed 

Smear fitting tends to be somewhat slower than CLEAN, 
but not prohibitively so, and most observations can be 
smear fitted using standard hardware in a reasonable 
length of time. Quantifying the speed of smear fitting 
relative to CLEAN or Maximum Entropy can be done 
for a few examples, but extrapolating from those exam- 
ples to all observations is nearly impossible, because the 
speed of smear fitting depends most strongly on the ob- 
servation's "complexity", which is itself difficult to mea- 
sure. The complexity increases with the number of com- 
ponents, but also depends on how much coupling there 
is between the components and how well the observation 
can be modeled with the given set of basis functions. To 
give some scale to "not prohibitively", using a 1.6 GHz 
Athlon CPU, CLEAN took one minute to produce the 
naturally weighted image of Figure 11 while smear fit- 
ting took twenty minutes to deconvolve the same simu- 
lated VLA snapshot (Figure 10). Neither algorithm was 
hampered by a lack of RAM or the need to self-calibrate. 

To some extent the speed of other deconvolution 
methods is also affected by source complexity, but to first 
order the processing time required by ME is set by the 
number of pixels, while CLEAN's depends on the num- 
ber of pixels with significant brightness, being faster than 
Maximum Entropy for typical VLA images with up to 10 6 
active pixels (Cornwell et al. 1999). 

Fitting a gaussian is more time consuming than plac- 
ing a CLEAN component, but one gaussian, or even bet- 
ter a specially selected function, in smear fitting corre- 
sponds to many CLEAN components. Typically CLEAN 
images have thousands of components, while smear fit- 
ting only uses a few dozen, and the flux of each CLEAN 
component is typically built up with 5 to 100 steps (a gain 
of 0.2 to 0.01). Smear fitting can be the fastest method 
for simple but extended sources and/or a small number 



(< 50000) of visibilities. Note however, that although the 
runtime of a single iteration of \ 2 minimization is propor- 
tional to the number of visibilities, improving the dirty 
beam with more uv samples can make component speci- 
fication more efficient and reduce the number of \ 2 min- 
imization iterations needed. 

The number of visibilities can be effectively lowered 
without drastically degrading the dirty beam by binning 
them. Normally binning the visibilities should be avoided 
since it can create problems (Briggs 1995) and is not re- 
quired when smear fitting 1 . Fortunately large data sets 
that are enough to make smear fitting annoyingly slow 
also tend to have dense uv coverage, eliminating many 
of the ambiguities that can impede model construction, 
and making them good candidates for binning. More im- 
portantly, smear fitting does not use uniform weighting, 
which is responsible for most of the imaging problems 
with binning. 



4 DISCUSSION 

Deconvolution by fitting simple models to the visibili- 
ties using x 2 minimization is by no means new. In fact 
it predates CLEAN (Hogbom 2003) but garnered a rep- 
utation of being difficult and unreliable. The most se- 
rious problem with traditional model fitting, at least for 
imaging, is that components corresponding to unresolved 
features can, and probably will, collapse into Dirac S dis- 
tributions or knife-edges. Smearing explicitly does away 
with that problem by convolving each component with 
an elliptical gaussian set by the uncertainty of the com- 
ponent's shape and location. The other commonly heard 
complaint traditional model fitting is that a model must 
be supplied before its parameters can be fit to the data, 
and in the typical case of incomplete data it is impossible 
to be sure that the model both has the right variables and 
started in the correct (i.e. global) local minimum of x 2 ■ 
In other words, different astronomers can derive different 
results from the same data because they started with dif- 
ferent initial models based on their subjective choices. Of 
course, that situation is not unique to imaging, but ob- 
jectivity is still worth striving for. Smear fitting removes 
some of the subjectivity in the final image by smearing 
the statistically insignificant details, but more pertinently 
its implementation as a patch to difmap promotes auto- 
matic model construction. Practitioners of smear fitting 
should only rarely need to intervene in the model con- 
struction process, for example by choosing to model a 
feature with an alternative functional form to an ellipti- 
cal gaussian, and in such cases should be able to support 
their choice based on an improvement in x 2 or positivity, 
or data from other wavebands. 

Smear fitting may appear to smear more than 
CLEAN for low surface brightness objects, but it must be 
remembered that whatever is smeared out of the model 

1 Technically binning and gridding the visibilities is necessary 
for the FFT of the residuals in the final map, but any bright 
features should first be moved out of the residuals and into 
the model. 
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is returned to the dirty residuals. Usually the CLEAN 
beam size is matched to the dirty beam, so smear fitting 
does not produce worse resolution than CLEAN 2 . Unfor- 
tunately the image is not the definitive place to determine 
whether sub-beam features are extended. To properly an- 
swer that question the reverse of smear fitting should be 
done; collapse the component (s) down to a single Dirac 
5 distribution and check whether x 2 is raised above the 
minimum by at least twice the number of parameters 
specifying the original shape. The smerf patch does not 
provide a command to automatically perform this check, 
but it is easy enough to do on a case- by-case basis. These 
properties of variable resolution also apply to ME if it is 
not convolved with a CLEAN beam at the end(Cornwell 
et al. 1999), although the test for resolution, by minimiz- 
ing the entropy, would in general not be as useful since 
ME works on an entire image at a time instead of specific 
features. 



4.1 Sensitivity and weighting 

Although smear fitting cannot deconvolve low surface 
brightness features any better than the other methods, 
its use is beneficial to being able to detect them. Smear 
fitting uses the most sensitive weighting scheme, natural 
weighting, while the other methods often downweight the 
inner visibilities to produce a sharper and more gaussian 
dirty beam. Robust weighting (Briggs 1995) is a consid- 
erable improvement over uniform weighting, but natural 
weighting still gives the greatest surface brightness sensi- 
tivity. All deconvolution methods can use natural weight- 
ing, but it produces a large beam, and no fixed resolution 
method can distinguish faint emission from bright peaks 
when they are within a beamwidth of each other. 

Uniform or superuniform weighting hurts the sensi- 
tivity to sharp as well as diffuse objects by effectively ig- 
noring central visibilities even though they measure the 
flux of sharp features just as well as the long baseline 
visibilities (ignoring possible differences in antenna sen- 
sitivities). Smear fitting uses all visibilities for detection 
of flux, so that the position of a visibility simply affects 
its leverage on the resolution. 

More subtly, smear fitting avoids the most common 
errors in fitting the bright features that typically limit the 
dynamic range of images deconvolved using other meth- 
ods. The first two of these errors are due to premature 
pixelization in order to use the Fast Fourier Transform 
(FFT). The FFT requires the visibilities to be placed in 
rectangular bins, and outputs the result in rectangular 
pixels. This introduces firstly a quantization error to the 
positions of the visibilities. Secondly, if anything but nat- 
ural weighting is used, it can have disastrous effects if a 
visibility is placed in a bin by itself when most visibilities 
share their bin with several others. Uniform weighting 
would give the lone visibility as much weight as a large 
batch of visibilities in a nearby bin, thereby amplifying 
the error of the lone visibility (Briggs 1995). Smear fitting 

2 The smear fitting analog of using an artifically small 
CLEAN beam would be to smear with A < 5.89, i.e. convolve 
by < 1 standard deviation. 
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Table 1. Smearing beam dependence on source declination 
for simulated VLA A Array snapshots on the meridian. Note 
that Ipe a k is the surface brightness in the dirty map. 



avoids binning by using analytical Fourier transforms for 
its model components. The final image is displayed us- 
ing pixels, but there is no error generated in the Fourier 
transformation of the model. The residuals still use the 
FFT, but typically in the final image their dynamic range 
is so low that the introduced error is negligable. 



4.2 Beam elongation 

A notable advantage of smear fitting is that it is less 
affected by the source elevation angle than CLEAN (Fig- 
ure 18). The axial ratio of an east-west interferometer's 
dirty beam is the absolute sine of the source's declination, 
meaning that the major axis approaches infinity for ob- 
jects near the celestial equator. Similarly, an array with 
both east-west and north-south baselines has foreshort- 
ened uv coverage for low elevation sources, so many ob- 
servations have strongly elliptical dirty beams that dis- 
tort the appearance of their features. CLEAN can mask 
that distortion by using a round restoring beam with the 
same radius as the semimajor axis of the dirty beam, 
but at the cost of losing resolution along the minor axis. 
Smear fitting copes better, since the density of baselines 
goes up when they are compressed along one axis. Thus 
a component that is resolved along that axis will have 
more visibilities brought in to where it needs them, so 
its signal to noise ratio will improve and it will not be 
smeared as much (see Table 1). Less resolved features 
benefit less, since for them the effect on x 2 (i.e. resis- 
tance to smearing) of a visibility is proportional to the 
fourth power of the baseline length, so the outer base- 
lines dominate (Appendix A). If the unresolved features 
are bright their major axes will be strongly affected by 
the baseline foreshortening, but if they are faint enough 
for the Fourier transform of their smearing beams to be 
enclosed by the envelope of where there are visibilities, 
they also experience the rounding effect. 
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Figure 18. Deconvolved images of a simulated VLA B array 
snapshot of Figure 8 at a declination of —30°. Top: CLEAN. 
Bottom: smear fitted. The solid gray ellipses are the half max- 
imum extent of the CLEAN beam (same in both) and the 
contours start at 25 /ijy/arcsec 2 , with each separated by a 
factor of 2 in brightness. 

4.3 Customized component types 

Elliptical gaussians are convenient basis functions for 
modelling most sources, but some objects, especially ones 
that have steep edges and known physical forms, are bet- 
ter fit if different functions are introduced. Figure 2 shows 
four deconvolutions of a VLA snapshot of a planetary 
nebula, Vy2-2 (Christianto & Seaquist 1998). Vy2-2 is 
young and well described at 15 GHz by an optically thin 
shell. The shell was only moderately resolved by CLEAN, 
and smear fitting with elliptical gaussians either broke the 
symmetry of the source or produced a unphysically nega- 
tive center for the shell. The ring could be better approxi- 
mated by using more gaussians, but splitting the the flux 
into smaller portions would enlarge the smearing beam 



for each component. Ideally smear fitting should use as 
few parameters as necessary, and the right framework (or 
model type) for those parameters. The difmap and the 
smerf patch do not explicitly have optically thin shells 
as a component type (although optically thick ones are 
available as flat disks), but one can be easily constructed 
by placing a negative optically thin ellipsoid (OTE) inside 
a positive one. Since the pair correspond to a single fea- 
ture, and especially since it is the difference of their flux 
densities, not their individual flux densities, that is phys- 
ically meaningful, smearing gaussians were calculated for 
them simultaneously instead of serially. The result was 
a much better fit, and the possibility of measuring the 
geometrical thickness of the shell. 

4.4 Future possibilities 

Model fitting by minimizing \ 2 is extremely flexible, and 
could be extended beyond providing a choice of compo- 
nent types. One avenue for future work would be to com- 
bine data from multiple polarizations and/or frequencies 
while applying constraints (for example Q 2 + U 2 + V 2 < 
I 2 and/or a spectral index) on how the components 
should appear in each subset of the data. Although such 
dynamic constraints have not yet been used with smear 
fitting, it is already possible to construct a model using 
one set of data and use it as the initial model with related 
data sets. For example, the positions of a set of isolated 
unresolved objects are the same for all polarizations and 
frequencies, so a model of the set can be produced and 
smeared in Stokes I, and then the Q and U models can be 
obtained by simply refitting the fluxes without needing 
to change the smears. 



5 CONCLUSION 

Smear fitting is an image deconvolution method that fits 
a near maximally simple model to interferometer visi- 
bilities and then broadens the model to account for the 
uncertainty of those visibilities. The model construction 
method avoids several problems that can limit the qual- 
ity of CLEAN and maximum entropy deconvolutions, and 
smearing generally yields sharper and fairer images than 
CLEAN. 

As mentioned above, smear fitting has been im- 
plemented as a modification (patch) to difmap, a 
well known program for imaging and selfcalibrating 
data from radio interferometers. The patch, known 
as smerf, is freely available under the GNU license 
(Stallman 1991) at http://www.drao-ofr.hia-iha.nrc- 
cnrc.gc.ca/~rreid/smerf/, and includes a manual on its 
use. 
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APPENDIX A: BASELINE LEVERAGE 

Consider a feature modeled by a circular gaussian with 
flux / and FWHM a. We can without loss of generality 
place the phase tracking center at its location, so that the 
model visibilities V m ,i are /exp (— 0.36a 2 u 2 ), where Ui is 
the length of baseline i. Writing the differences between 
the measured and model visibilities as e^, 

Xunsmeared = ^ (Al) 
i 

Smearing the component with a round gaussian with 
FWHM a s multiplies the model visibilities by /3(a 2 « 2 ) = 
exp (— 0.36a 2 u 2 ) . (This discussion can easily be extended 
to an elliptical smearing beams, but it is not warranted 
here.) The expected rise in \ 2 due to smearing is 

smeared Xunsmeared 

(A2) 

= ^-^(i-^^.'K 1 -^^-.^ 2 ^] ( A3 ) 

The leading term in a Maclaurin expansion of 1 — /3 
is 0.36a 2 ?i 2 , so if the component is sharp (nearly flat in 
the uv plane), the are small, and there is little smear- 
ing, the effect of a visibility on A\ 2 is proportional to 
the fourth power of its baseline length. If the component 
is resolved V m ,i attenuates the importance of the outer 
baselines. Their special status can also be removed by a 
large smearing beam, since 1 — j3 saturates at 1. 

Note that this leverage is simply the relative impor- 
tance of the visibility. Since (5 is a function of the product 
of a 3 and Ui, a s is inversely proportional to an average 
baseline length, as one would expect, but the average is 
weighted towards the outer regions of where there is sig- 
nificant measured amplitude in the uv plane. 
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